Extracting Policy Positions from Political Texts Using Words as Data
نویسندگان
چکیده
We present a new way of extracting policy positions from political texts that treats texts not as discourses to be understood and interpreted but rather, as data in the form of words. We compare this approach to previous methods of text analysis and use it to replicate published estimates of the policy positions of political parties in Britain and Ireland, on both economic and social policy dimensions. We “export” the method to a non-English-language environment, analyzing the policy positions of German parties, including the PDS as it entered the former West German party system. Finally, we extend its application beyond the analysis of party manifestos, to the estimation of political positions from legislative speeches. Our “language-blind” word scoring technique successfully replicates published policy estimates without the substantial costs of time and labor that these require. Furthermore, unlike in any previous method for extracting policy positions from political texts, we provide uncertainty measures for our estimates, allowing analysts to make informed judgments of the extent to which differences between two estimated policy positions can be viewed as significant or merely as products of measurement error.
منابع مشابه
Scaling Policy Preferences from Coded Political Texts
Scholars wanting to estimate substantive quantities of interest, for example policy positions, from political texts typically apply a coding scheme to discrete text units such as words or sentences. Scales of policy positions, for example a left-right scale of economic policy, are typically built from the relative frequencies of text units coded into different categories. In this paper we reexa...
متن کاملEstimating Policy Positions from Political Texts
D eriving reliable and valid estimates of the policy positions of key actors is fundamental to the analysis of political competition. Various systematic methods have been used to do this, including surveys of voters, politicians, and political scientists, and the content analysis of policy documents. Each method has advantages and disadvantages but, for both theoretical and pragmatic reasons, p...
متن کاملA Dynamic State-Space Model of Coded Political Texts
This article presents a new method of reconstructing actors’ political positions from coded political texts. It is based on a model that combines a dynamic perspective on actors’ political positions with a probabilistic account of how these positions are translated into emphases of policy topics in political texts. In the article it is shown how model parameters can be estimated based on a maxi...
متن کاملScaling Policy Positions From Coded Units of Political Texts∗
Applying a coding scheme to discrete text units has long been the most common method for estimating substantive quantities of interest about the authors of these texts, whether for political, social, economic, or other substantive reasons. In political analysis, researchers typically build scales of policy positions from the relative frequencies of text units coded as left versus right policy c...
متن کاملTreating Words as Data with Error: Uncertainty in Text Statements of Policy Positions
Political text offers extraordinary potential as a source of information about the policy positions of political actors. Despite recent advances in computational text analysis, human interpretative coding of text remains an important source of text-based data, ultimately required to validate more automatic techniques. The profession’s main source of cross-national, time-series data on party pol...
متن کامل